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Invoke URL checker. 
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Scan Document to locate hypertext 
links including a URL. 
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For each located hypertext link, do: 
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Determine context terms in predetermined 
vicinity of located hypertext link. 



Issue HTTP Get request to access 
web page referenced in hypertext link 
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Scan received web page to determine 
instances of context terms in web page. 
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Does 
web page include 
sufficient number of instances of 
intext terms to satisfy qualify: 
threshold? 
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Goto block 150 in FIG. 3 
to seek correct URL. 
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Go to next hypertext link in document 
until all hypertext links considered. 
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For each top level domain (TLD) different 
from TLD domain in URL of hypertext link, 

generate a modified URL using each 
different TLD with domain name in URL. 




Add each modified URL with different 
TLD to URL variation list. 
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Parse domain name as one word or compound 
words and spell check the parsed term(s) to 
generate possible correct spelling of the term and/ 
or compound terms. 
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Generate set of possible correct spellings of 
domain name, which may include different 
combinations of generated possible correct 
spelling of compound terms. 



Generate a modified URL for each 
possible correct spelling of domain 
name in the generated set. 



Append each modified URL generated with 
possible correct spelling to URL variation list. 
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Perform stemming algorithm on URL 

domain name to determine 
morphological variations of domain 
name and the compound terms that 
form the domain name. 



Generate a modified URL for each 
determined morphological variation 
of domain name or combination of 
variations of compound terms that 
form the domain name. 
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Append each modified URL 
generated with morphological 
variation of domain name or 
compound terms forming domain 
name to URL variation list. 



Go to block 200 
in FIG. 4. 
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For each modified URL / in 
URL variation list, do: 



200 




Submit HTTP GET request 
to modified URL /. 
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Scan received web page to 

determine instances of 
context terms in web page. 
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Does 
'web page include" 
sufficient number of 
.instances of context terms^ 
40 satisfy qualifying 
threshold^ 



No 



No 



Yes 




Append modified URL / to 
possible correct URL list. 
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Go back to block 200 until all URLs in 
URL variation list considered. 



Goto block 116 in FIG. 2 
for further hypertext links. 
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The first president of the United States was George 
Washington. President Washington, together with city planner 
Pierre L"Enfant, chose the present location of the White House, 
which is now 1 600 Pennsylvania Avenue. The White House 
has a long and fascinating hist ory. For more information on the 
White House, I suggest you go MMM 
is the official government web site of the White House. ^302 



